A Numerical Algorithm for Zero Counting. 
II: Distance to Ill-posedness and Smoothed Analysis 



Felipe Cucker * 
Dept. of Mathematics 
City University of Hong Kong 
HONG KONG 
e-mail: macucker@cityu.edu.hk 

Gregorio Malajovich-'- 
Depto. de Matematica Aplicada 
Univ. Federal do Rio de Janeiro 
BRASIL 
e-mail: gregorio@ufrj.br 



Teresa Krick ^ 
Departamento de Matematica 
Univ. de Buenos Aires & CONICET 
ARGENTINA 
e-mail: krickSdm.uba. ar 

Mario Wschebor 
Centro de Matematica 
Universidad de la Republica 
URUGUAY 
e-mail: wscheborOcmat . edu . uy 



Abstract. We show a Condition Number Theorem for the condition number of zero 
counting for real polynomial systems. That is, we show that this condition number equals 
the inverse of the normalized distance to the set of ill-posed systems (i.e., those having 
multiple real zeros). As a consequence, a smoothed analysis of this condition number 
follows. 



1 Introduction 



This paper continues the work in [S], where we described a numerical algorithm to count the 
number of zeros in n-dimensional real projective space of a system of n real homogeneous 
polynomials. The algorithm works with finite precision and both its complexity and the 
precision required to ensure correctness are bounded in terms of n, the maximum D of the 
polynomials' degrees, and a condition number 

In this paper we replace «(/) — which was originally defined using the computationally 
friendly infinity norm — for a version (defined in Section [2] below) which uses instead 
Euclidean norms. This difference is of little consequence in complexity estimates since one 
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has (cf. Proposition 13.31 below) that 



(1) 



It allows one, however, to prove a result following a classical theme in conditioning — the 
relation between condition and distance to ill-posedness — and to deduce from this result a 
probabilistic analysis for /?(/) and, a fortiori, for the complexity of the algorithm in [8]. This 
analysis is of interest since, in contrast with n and D, the value of (or of is not 

apparent in / and therefore complexity or accuracy bounds depending on this condition num- 
ber are not of immediate use. A solution pioneered by John von Neumann and collaborators 
(see [m §2.1] and references therein) and reintroduced by Steve Smale [231 |2l] is to assume 
a probability measure on the space of data and to study the condition number at data / as 
a random variable. This approach yields bounds on accuracy or complexity for random data 
and has been pursued in several contexts: systems of linear equations [121 [B]) polyhedral 
conic systems [3l El [9l [151 125] , linear programs [1], complex polynomial systems [21], [18], etc. 
In our case it allows us to trade the presence of k(/) in deterministic bounds for probabilistic 
bounds in n and D only. 

1.1 Distance to Ill-posedness 

It has since long been observed |llj that the condition number for several problems (in its 
original aception, as a measure of the worst possible magnification of small input errors in 
the output [19]) either coincides with the relativized inverse of the distance from the input 
to the set of ill-posed data or is bounded by a small multiple of this inverse. A data is 
ill-posed when the magnification above is unbounded. In our case, a polynomial system / 
is ill-posed when arbitrary small perturbations of / may change its number of projective 
real zeros. Systems having this property are exactly those having multiple (real projective) 
zeros. Let us denote the set of such systems by Sr. Also, let Ti^ denote our set of input 
systems, i.e. the set of / = (/i, . . . , /„), n real homogeneous polynomials in n-|- 1 variables of 
degrees d := (di, . . . , d„) respectively, endowed with the Bombieri-Weyl norm || Ww (defined 
in Section [2|). Finally, let dist be the distance on induced by this norm. 

Our main result in this note is the following. 



Remark 1.2 It is worth noting that, although k(/) is somehow related to the condition 
number fJ-normif) for complex polynomial systems (cf. [Il[20]) a result like Theorem 1 1 . 1 1 does 
not hold for the latter. Actually, such a result holds on the fibers of the zeros (see [IJ Ch. 12, 
Theorem 3]) but not globally. 

We also note that, as a consequence of Theorem [LT] we have /?(/) > 1 for all / G 7id- 
This feature, although not immediate from the definition of k(/), follows immediately from 
the fact that G Sr. 



Theorem 1.1 For all f G Hd, k(/) 



ll/llw- 



dist{/,SK) ■ 
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1.2 Smoothed Analysis 



Theorem 11.11 carries out meaningful consequences in the probabihstic analysis of 

The usual probabilistic analysis for polynomial systems assume that random / are drawn 
from the unit sphere S{Ti.d) with the uniform distribution (or from such a distribution on the 
real projective space induced by this sphere). A different approach to the randomization of 
input data has been recently proposed under the name of smoothed analysis [22j. The idea 
consists on replacing 'random data' by 'random perturbations of given data'. A recent result 
in [2] derives smoothed analysis bounds for condition numbers which can be written as inverses 
to distances to ill-posedness. Because of Theorem ll.il these bounds can be straightforwardly 
applied to 



To state this result we need to introduce some notation. Let PP(M) denote the real 
projective space of dimension p and dp be the projective distance on P^(]R) (i.e. the sinus of 
the Riemannian distance). For a point a £ PP(M) and a G (0, 1] we denote by B{a,a) the 
ball (w.r.t. dp) centered at o and of radius a. That is, 

B{a,a) := {x £ P^'(M) | dp(x,a) < a}. 

In what follows we assume B{a, a) endowed with the uniform probability measure. Note that 
in the particular case cr = 1 we obtain B{a,a) = P^'(M) for each a E P^ and hence, the usual 
probabilsitic analysis referred to above. 

Theorem 1.3 [2j Let S c¥P be contained in a projective hypersurface H of degree at most 
d and ^ : P^ ^ [1, oo] he given by 

nz) = 



d]i>{z,S) 

Then, for all a G (0, 1] and all t>{2d+l)^, 

sup Prob {'tf{z) >t}< 13dp— 

agpp zeB{a,(7) at 

and 

sup E [In^(z)] < 21np+ 21nfi + In ( i ) + 5. □ 

a&P zeB{a,a) \a ^ 



As a consequence of this result we obtain the following corollary (which we prove in Sec- 
tion [3]). Here P('Hd) denotes the projective space associated to Tid, N denotes its dimension 
and T> = di - ■ ■ dn its associated Bezout number. 

Corollary 1.4 For all a G (0, 1] and all t > {AnV"^ + l)f ; 

sup Prob {K{g) >t}< ISn^DPiV— 
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and 

sup E [lnK(fi()] < 21n7V + 41nn + 21ni:> + lnD + ln ( i ) +6. 

f&ind)9eBif,a) \(J J 

In particular, taking a = 1, we obtain average analysis: for all t > N{4nT)^ + 1), 
and 

E [InK(g)] < 21nA^ + 41nn + 21nP + ln(D) +6. 

A recent result [7J extends Theorem 11.31 to absolutely continuous measures on B{a,a) 
whose densities are radially symmetric around a and may have a pole at a. Applications of 
this result to readily follow. 

2 Setting and Notations 

For d G N, 7i(i denotes the subspace of M[xo, • • • ,Xn] of homogeneous polynomials of degree 
d. We endow Tid with the Bombieri-Weyl inner product ( , defined for / = X]|j|=^ Qj^c-' 

and g = E|j|=d^i^^ by 

{f,g)w =^%r 
\j\=d \j) 

where for j = (jq, . . . , jn), (j) := jgi'.j \ - ^ main feature of this inner product is its invariance 
under the action of the orthogonal group 0(n + 1). That is, for aW ip £ 0{n + 1) and all 
f,g e Hd, {foi^,go'tp)w = {f,9)w- Next, for di,. . . ,d„ E N, we endowed = 'Hd^x.- ■x'Hd„ 
with the inner product 

n 

{f,a)w = ^{fu9i)w 

i=l 

where / = (/i, . . . , /„), g = {gi, . . . ,gn) S 7id- We write || \\w and dist to denote the norm 
and distance on TC^ induced by this inner product. 

Projective zeros of polynomial systems / G TCd correspond to pairs of zeros (— C) of 
the restriction f^gn of / to the n-dimensional unit sphere C M""^^. We will thus consider 
a system / E TC^ as a (centrally symmetric) mapping of into M". 

For a point x £ and a system / £ 7id one may define both ill-posedness and condition 
relative to this point. For the first, one defines 

= {/ G Tid I X is a multiple zero of /}, 

the set of systems which are ill-posed at x. Note that ^ for all x G and that 

= {/ G T~(-d I / lias a multiple zero in S^} = Sir(x). 
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Towards the definition of «(/), for / G Hd and x G 5", we define 

^1 



(2) 



where Df{x)\T^S" is the restriction to the tangent space of x at 5" of the derivative of / at 
X and the norm is the spectral norm, i.e. the operator norm with respect to || ||2. 

Next, we define the condition of f relative to x to be 
-.(fx)- ""^""^ 



Finally, we take the condition number of / G Hd to be its condition relative to its worst 
conditioned point, 

«(/) = maxK(/,a;) 

Note that for all A 7^ 0, k(A/) = and dist(A/,EM) = | A|dist(/, Er). The same is 
true relative to a point x G S"*. We will therefore assume, without loss of generality, that 
|vK = 1, and denote by S{Hd) the unit sphere in Hd- 



3 The proofs 



3.1 The main results 



Proposition 3.1 For all x e and f G S{Hd), 

K{f,x) 



1 



dist(/,SM(x))- 



Proof. For < z < n, let = (0, . . . , 0, 1, 0, . . . , 0) denote the ith coordinate vector. The 
group 0(n+l) acts on Hd x S"" and leaves //norm, ^ and distance to T,^{ ) invariant. Therefore, 
we may assume without loss of generality that x = cq. This implies that T^^S^ ~ (ei, . . . , e„) 
and we may write the singular value decomposition 



diag 



Ur, 



0-1 



an 



u 



with U and V orthogonal and o"i > (T2 > • • • > o"n > 0. Since the subgroup of 0{n + l) leaving 
eo invariant is isomorphic to 0(n) acting on T^^S^ we may as well assume that V = Id. Note 
that /Inorm(/,eo) = a'^ , and therefore K(/,eo) = (a^ + ||/(eo)||2)~'^'^^- 
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Let gi{x) := fi{x) - /i(eo)xo' - ^/d'ianUinXQ^ ^x„, where Un = (tiin, • • .,Unnf- Clearly, 
dii^o) = and Dgi{eo)en = (here and in the sequel we denote Dgi{eo)\TeQS" simply by 
Dgi{eo)) since %/9x„(eo) = dfi/dxnieo) - y/dluinCTn = 0. Thus, g = {gi, ...,gn) e SM(eo). 
Moreover 



Wfi - 9i\\w = fiieof + Q ^ iVdiC^nUinf = |/i(eo)|^ + cr'^u 



2 , _2„.2 
in 



and hence, using \\un\\ = 1, 



It follows that 



\\f-g\\w = \\fieo)\\l + 'T'n = m,eor'- 
dist(/, SK(eo)) < 11/ - ffllH/ = eo)-^ 



For the reciprocal, let g G SiR(eo). Then, g{eo) = and Dg{eo) is singular. We want to 
show that 11/ — g\\\Y > k(/, eo)~^. To this end, we write 

dfi dfi 

fi{x) = /j(eo)V + ^(^o)a;o' "^^1 ^ 'Q^i^Q)^0~^^ri + Qi{x) 



with deg^ Qi < di — 2 and, similarly. 



dxi 



Then 



and 



11/^ - 5*11^- > l/*(eo)P + J.\\Dfi{eo) - Dg,{eo)\\l 



5ll^>ll/(eo)|li + 



diag ( 4^] Dfieo) - diag ( ] Dg{eo) 



We know that diag ( ) Dg{eo) is singular. Hence, denoting by Sing„ the set of singular 



n X n matrices and by distj? the Frobenius distance on this set of matrices, 
distF (diag ( Df{eo),d\ag ( ^ ) Dg{eo)] > dlstp (diag ( ^/(eo), Sing„ ) = an, 



□ 



, /dj \VdiJ J V \Vdi 

the last by the Eckart- Young Theorem p], §11.1]. It follows that 

||/-<7||^>||/(eo)||i + ^^ = ^5(/,eo)-'. 



Proof of Theorem II. 1[ Again we can assume / G SiTid)- Note that 
dist(/, Sr) = min dist(/,5r) = min dist(/, 



min dist(/,SK(x)) dist(/, Sm) ' 

x&S" 

Before proving Corollary 11.41 we recall some useful facts in algebraic geometry. 

For 1 < i < n, let fi = ^y^^^. UijX^ be a generic (i.e. with indeterminate coefficients) 
homogeneous polynomial of degree di in the variables x = (xq, • • • , x„) and / = (/i, . . . , /„). 
Set N := Y17=i ('^'n") ~ dimension of the projective coefficients space. The d- 

discriminant variety Sc C P^(C) is the locus of such polynomial systems / = (/i, . . . ,/„) 
with multiple zeros, i.e. such that there exists z E C"^-^, z 0, with fi{z) = ■ ■ ■ = fn{z) = 
and Df{z) has rank < n. It is well-known that Sc is a hypersurface in P^(C) defined by 
an irreducible polynomial Disc(/) G ^[uij] (see [IT] or [161 Ch. 10]). For lack of a precise 
reference we prove the following result. 



since Sr = IJxeS" ^r{x)- Then, 

1 



k(/) = maxK{f,x) = max 



xeS" ' xeS" dist(/, I;r(x)) 



Lemma 3.2 

n ^ 

deg(Sc) = nV + {di + ■ ■ ■ + dn - n - l)vY,-r- 

1 "'i 
j=i J 



Proof. We know that deg(Sc) = deg(Disc(/)). We apply Identity (13) of [TO] : 

ReSp,rfi,...,rf„(J/,/i, ...,/„)= Resrfi,,..,rf„(/{', . . . , /°)Disc(/), 

where the standard notation Res^j^...^^^ is for the multihomogeneous projective resultant of 
n generic homogeneous polynomials of respective degrees di,...,dn in n variables, p := 
di + ■■■ + dn — n, J f is the determinant of the matrix {dfi/dxj)i<ij<n and denotes the 
homogeneous component (of degree di) of /j(l, xi, . . . , x„). 

We note that deg(J/) is a polynomial of degree pin x whose coefficients are polynomials 
in Uij of degree n. On the other hand Res^^^...^^^ is a multihomogeneous polynomial of degree 
Y\k=/=idk group of variables Uij ([6]). Therefore, since J/ has degree n in the Uij, we 

derive 

nV + p -= ^+deg(Disc(/)). 
The statement easily follows. □ 



Proof of Corollary 11.41 We have Sk C Sc and for ah / e Wd, / / 0, 

dist(/,SM) dp(/,SMnP^(M))- 

The statement now follows from Theorem [L3] with p = N, S = SiRnP^(M), H = ScnP^(M), 
and d = n'^'DV, the last by Lemma [32] since nV + {p - 1)P Yl]=i i" ^ "-^DP. □ 
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3.2 The equivalence of condition numbers 



In [8] we considered TC^ endowed with the norm given by 

11/11 := max \\fi\\w 

l<t<n 

(we note ||/|| < < V^\\f\\) and defined 

:= rnaxmin |/inorm(/,a;), 

with 



\\fix)\\c 



fJ-noTmif,x):=Vn\\f\\ Df{x)J ^„d\ag{y/di) = — /inorm(/, a;) 



(we note /^norm(/) 2^) < ^norm(/) 3:^) ^ \/n finormif , x)) . The next result shows that the k(/) 
thus defined is closely related to k(/). 



Proposition 3.3 



n 



< Kif) < V2n Kif) 



Proof. Let x e 5". We observe that since ||/||^//norm(/, a;) ^ + ||/(a;)||i > 
\\f\\wV'norm{f,x)~'^, we have k(/,x) < Jlnormif,x) < finoTm{f,x). Similarly, using WfWw < 
V^WfW and ||/(x)||2 > ||/(x)||oo, we have k(/,x) < V^||/||/||/(x)||oo. Therefore 



K'{f,x) < ^/nmin\ finormif,x), 



\\fix)\\c 



which implies 



k(/) = maxK(/,x) < ^/n ma-Kuiin I fJ.norm{f,x) 



ll/(^)l|c 



nK{f). 



To prove the other inequality note that, for any x € S'^, 

2 11^112 



mm 



L'/(x)|y^^„diag(v^ 



' 11/(^)111 



< 



< 



Z)/(:E)p^;^„diag(^) +||/(x)||i 



mw'w 



\lvJlnoUf,x)-^ + \\fix)\\l 

2K{f,x)\ 



and ||/(x)||2 < \/n ||/(a;)||oo- Therefore, 

min|;Unorm(/,a;) 
This implies k(/) < -v/2nK(/). 



Il/(^)l|c 



< V2nK{f,x) 



□ 
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